19
Task 1.2
An important task of bioinformatics is the collection and management of data and the
provision of helpful tools. Name and describe two databases containing information on,
for example, genes and gene expression datasets.
Task 1.3
Example:
The MEDLINE database (also known as PubMed) is a large, worldwide open library
about medicine and biology. Here you can find publications and sequences as well as a lot of
other information and links. So PubMed is a good first entry site to use when starting a search.
Familiarize yourself with the PubMed database (https://www.ncbi.nlm.nih.gov/pubmed) and
find out about the artificial sequence for the “TAR protein”. Hint: Search with “synthetic”, all
searches are in English after all; the search is only limited enough by keywords if only one
sequence is found by the query. Only then can you clearly answer the following questions.
1. Which of the following statements about sequence length (amino acid = aa) is
correct?
A. The protein sequence is 267 aa long.
B. The protein sequence is 367 aa long.
C. The protein sequence is 276 aa long.
D. The protein sequence is 376 aa long.
2. Which of the following statements about the title is correct?
A. The sequence was filed under the title “Cloning of human full-length CDS in
Creator (TM) recombinational vector system” in PubMed.
B. The sequence has been filed under the title “Uploading of human full-length
CDS” in PubMed.
C. The sequence has been filed under the title “Uploading of recombinational
vector system” in PubMed.
D. The sequence has been filed under the title “Cloning of recombinational vec
tor system” in PubMed.
3. Which of the following statements is correct?
A. Hines et al. submitted the sequence to the journal Biological Chemistry and
Molecular Pharmacology, Harvard Institute of Proteomics on 05-JAN-2015.
B. Darwin et al. submitted the sequence to the journal Biological Chemistry and
Molecular Pharmacology, Harvard Institute of Proteomics on 05-JAN-2005.
C. Hines et al. submitted the sequence to the journal Biological Chemistry and
Molecular Pharmacology, Harvard Institute of Proteomics on 05-MAR-2005.
D. Hines et al. submitted the sequence to the journal Biological Chemistry and
Molecular Pharmacology, Harvard Institute of Proteomics on 05-JAN-2005.
Task 1.4
Bioinformatics has taken off since the mid-1990s, when the first genome projects were
successfully completed, because of its rapid sequence analyses. Sequence comparison (for
1.3 Exercises for Chap. 1